CS 229 Project Report: San Francisco Crime Classification

نویسندگان

  • Charles Hale
  • Feng Liu
چکیده

Different machine learning approaches were conceptualized and implemented for predicting the probabilities of crime categories for crimes reported in San Francisco. The crimes records used in the research are downloaded from a competition on Kaggle. A Bayesian model, a mixture of Guassians model (stratified and unstratified), and logistic regression are implemented. A satisfactory result was achieved with Bayesian model, corresponding to a Kaggle leaderboard of 852th out of 2335 teams.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Crime Prediction in San Fransisco

In June 2015, Kaggle began a competition named “San Francisco Crime Classification”[8], ending in June 2016. The competition’s dataset caught our attention due the subject being very tangible, with crime being at the forefront of modern media and to San Francisco being culturally significant due to its current tech industry. The dataset is also described by geographic and temporal features, the...

متن کامل

San Francisco Crime Classification

San Francisco Crime Classification is an online competition administered by Kaggle Inc. The competition aims at predicting the future crimes based on a given set of geographical and time-based features. In this paper, I achieved a an accuracy that ranks at top %18, as of May 19th, 2016. I will explore the data, and explain in details the tools I used to achieve that result.

متن کامل

RoboCop: Crime Classification and Prediction in San Francisco

In this paper, we employ machine learning and other statistical techniques to the problems of classifying and predicting crimes in San Francisco. Drawing upon existing research in the field to approach these two problems, we employ Random Forest and VAR(p) models, respectively. For the classification problem, our results across all 39 crime categories demonstrate the difficulty of the fully-spe...

متن کامل

San Francisco Crime Classification 2015

We aim to classify the type of crimes committed within San Francisco, given the time and location of a criminal occurrence. This study is important and beneficial. Using data mining approaches, we can predict the location, type and time of criminal occurrences in the city. We also explore some interesting questions, for example, if more crimes occur on certain days of the week or certain times ...

متن کامل

A Computational Model for Multi - Instrument Music Transcription CS 229 Final Project Report , Autumn 2013

The aim of our project is to build a model for multi-instrument music transcription. Automatic music transcription is the process of converting an audio wave file into some form of music notes representations. We propose a two-step process for an automatic multiinstrument music transcription system including timbre classification and source separation using probabilistic latent component analysis.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016